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Microprocesso r-Controlled Full-Duplex Speakerphone 
Using Automatic Gain Control 



Background of the Invention 

The invention relates to wireless speakerphones, and more particularly, to a microprocessor- 
controlled full-duplex speakerphone using automatic gain control. There are two basic types of 
speakerphones available on the market today: A lower-cost, half-duplex design aimed at the consumer 
5 market, and an expensive full-duplex DSP implementation for business applications. The major 
technical obstacle to overcome in designing a speakerphone is the prevention of unstable feedback 
(howling, or squealing) caused by adjusting the speaker and/or microphone gains too high. The first 
solution to this problem was the half-duplex speakerphone. 

In the receive direction (the far-end person is heard via the speaker), it is obviously desirable 

10 to provide a relatively large gain on the speaker, but due to the proximity of the microphone to the 
speaker in the speakerphone enclosure, the microphone will detect the far-end person's voice and 
amplify it back to the far-end. This acoustic coupling is the source of half of the feedback loop in the 
speakerphone, and results in an annoyingly high level of sidetone in the far-end handset. To mitigate 
this acoustic coupling, the half-duplex speakerphone reduces the gain of the microphone to its 

15 minimum when the far-end person is talking, so that none of the far-end person's voice is returned 
back. 

In the transmit direction (the near-end person speaks into the microphone), it is obviously 
desirable to provide a relatively large gain on the microphone to allow greater distances between the 
person speaking and the microphone. However, due to the electrical connection of the microphone to 

20 the telephone lines (via the 2-wire to 4-wire hybrid interface), a part of the transmitted voice signal 
is reflected back into the RX speech path, which is then amplified by the speaker driver with the result 
that the near-end person's own voice is amplified into the room. This hybrid sidetone is the second 
half of the feedback loop in the speakerphone, and is the natural way typical telephone handsets 
provide sidetone from microphone to the earpiece (the microphone gain contributes to the sidetone 

25 level). To mitigate this hybrid sidetone, the half-duplex speakerphone reduces the gain of the speaker 
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to its minimum when the near- end person is talking, so that none of the near-end person's voice is 

amplified into the same room. 

Whenever the microphone and speaker gains are not balanced in this "see-saw" 

minimum/maximum way, the familiar acoustic feedback sound (howling, squealing) can easily result 
5 from the completed feedback loop provided by the acoustic coupling and hybrid sidetone audio paths. 

This "see-saw" gain adjustment process requires the speakerphone to determine which person is 

talking, and it must arbitrate the two signal paths accordingly. These functions are typically provided 

by an expensive analog voice-switched speakerphone chip, but the arbitration typically suffers from 

several basic disadvantages: 
10 The speakerphone gives priority to the loudest person speaking when both people are 

attempting to speak simultaneously, for example, when one person is trying to interrupt the other. 

This is a disadvantage for the case of a weak signal from a distant phone, or from a person who is not 

sitting very close to the speakerphone. In these cases it may be necessary for the far-end person to 

unnaturally shout into the handset, or for a person at the far end of the table to temporarily move 
15 closer to the speakerphone. 

The slow switching time during this volume comparison usually results in the loss of a few 

syllables at the beginning of the interruption, which generally results in the person having to repeat 

the whole sentence. 

The full-duplex DSP-based speakerphone implements robust signal cancellation of the two 

20 speech paths to eliminate the coupling of the two channels. As a result, the microphone and speaker 
gains can be maintained at high levels throughout the conversation, thus eliminating the voice 
switching altogether. However, this high quality demands a high price because 1) it requires a 
powerful DSP engine capable of performing these calculations on both speech paths at the 8kHz 
sample rate, and 2) the algorithm requires considerable DSP and audio experience. 

25 The user-controlled volume setting is typically implemented in hardware in an analog 

speakerphone via a potentiometer, but suffers from degraded audio quality over time due to dust in 
the potentiometer mechanism and DC offset drift. Typical cordless phone designs house the 
speakerphone function in the base station. The disadvantage of this configuration is the obvious one: 
the speakerphone functions are limited by the length of the wires connecting the base station to the 

30 telephone jack and to the power. 

In view of the foregoing, what is needed is a speakerphone which allows a full-duplex 
conversation, i.e.: simultaneous speaking and hearing, without an external analog speakerphone chip 
and a DSP engine, without external analog decoders with resistor ladders for providing gain, and 
without using a potentiometer for user-controlled volume control. Furthermore, the speakerphone 

35 functions should be housed in the portable handset. 

Summary of the Invention 
A near full duplex portable handset speakerphone comprises: a microprocessor; a hands-free 
receive register connected to the microprocessor; a hands-free transmit register connected to the 



- 2 - 




TT1220 

microprocessor; a ROM having a speakerphone operation algorithm, the ROM connected to the 
microprocessor; a first analog-to-digital converter connected to the hands-free receive register; a second 
analog-to-digital converter connected to the hands-free transmit register; a first programmable digital 
attenuator connected to the microprocessor and to a speaker; and a second programmable digital 
5 attenuator connected to the microprocessor and to a microphone, wherein near full duplex 
communication is achieved without digital signal processing. 

In another feature of the invention, the hands-free registers provide a digital representation 
of the speech volume in each direction to the microprocessor. The microprocessor monitors the speech 
signal levels, calculates digital volume comparisons in order to make speech gain decisions for optimal 
10 sound, and digitally adjusts the gains in the two speech paths to the upper half of their maximum 
values. 

Brief Description of the Drawings 

Fig. 1 is a block diagram of a wireless speakerphone system 10 built according to the present 
invention. 

15 Fig. 2 is a diagram of the state machine 60 of the speakerphone algorithm 41, showing timer- 

controlled state transitions. 

Fig. 3 is a flow chart of the algorithm for the peak detector 70 of the speakerphone algorithm. 
Fig. 4. is a flow chart of the algorithm for the idle state 68 of the speakerphone algorithm. 
Fig. 5 is a flow chart of the algorithm for the RX state 62 of the speakerphone algorithm. 
20 Fig. 6 is a flow chart of the algorithm for the TX state 64 of the speakerphone algorithm. 

Fig. 7 is a flow chart of the algorithm for the full-duplex state 66 of the speakerphone 
algorithm. 

Fig. 8 is a flow chart of the RX quantification routine 190 of the speakerphone algorithm. 
Fig. 9a is a flow chart of the RX gain adjustment routine 230 of the speakerphone algorithm. 
25 Fig. 9b is a flow chart of the RX AGC 231 of the speakerphone algorithm. 

Fig. 10 is a state diagram and table showing the substates within the full-duplex state 66. 
Fig. 11 is a flow chart of the full-duplex substate initialization routine of the speakerphone 
algorithm. 

Fig. 12 is a diagram showing the mapping of the RX and TX volume levels. 
30 Figs. 13a and 13b are flow charts of the routine by which the mapping of Fig. 12 is 

determined. 

Fig. 14 is block diagram of a first alternate embodiment, having the speakerphone in the base 

station. 

Fig. 15 is block diagram of a second alternate embodiment. 
35 Description of the Preferred Embodiment 

In Fig. 1 a wireless speakerphone system 10, built according to the present invention, uses a 
pair of integrated circuit controller chips 12, 14 to provide a digital wireless voice link between a 
portable handset 16 and a base station 18. A signal 20 from a far-end telephone 22 is received by a 
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PBX or a telephone central office 24. The office 24 transmits the signal 20 to the base station 18. In 
addition to the controller chip 12 mentioned above, the base station 18 also includes a hybrid 26, which 
is a telephone line interface (a 2-wire to 4-wire hybrid interface). The controller chip 12 includes a 
codec 28, which is a coder/decoder of signals. The codec 28 serves as the analog interface to the 
5 telephone line. The base station 18 further includes an RF 30, which is a radio frequency interface. 

Referring now to the portable handset 16, it includes the speakerphone functionality, rather 
than having such functionality included in the base station 18. The user makes the usual decision to 
enable either the handsets earpiece and microphone (not shown in Figl), or the "hands-free" 
speakerphone interface (consisting of blocks 50, 44, 48, 42 in Fig. 1) for the telephone conversation. 

10 A codec 32 in the controller chip 14 handles the analog speakerphone interface. The controller chip 
14 further includes an embedded hands-free receive register RX 34 and an embedded hands-free 
transmit register TX 36 in its speech paths. 

A microprocessor uP 38 controls the functioning of the controller chip 14. A read-only memory 
ROM 40 houses a speakerphone algorithm 41, not shown. A microphone Mic 42 picks up the speech 

15 of the user, and a speaker 44 delivers the far-end user's speech to the user of the portable handset 16. 

A pre-amplifier PRE 46 provides programmable gain of either +3dB or +18dB. The amplifiers AMP 
48, 50 external to the controller chip 14 are analog amplifiers. The AMP amplifiers (RX attenuation 
register 52 and TX attenuation register 54) internal to the controller chip 14 are programmable digital 
attenuators providing OdB to -42dB gain, and mute. The portable handset 16 also includes a radio 

20 frequency interface RF 56. 

Although hands-free registers exist in both of the identical controller chips 12 and 14, hands- 
free registers in the controller chip 12 are not used, because the speakerphone algorithm executes 
solely in the portable handset 16. 

The speakerphone algorithm 41 includes three sequential tasks performed by the on-chip pP 

25 38: reading the hands-free registers and determining the peak volume levels of both speech paths; 

executing a speakerphone state machine 60 (shown in Fig. 2); and digitally adjusting the microphone 
and speaker gains as directed by the speakerphone state machine. The speakerphone algorithm 41 
uses timers and peak detection as its two basic pillars. The timers are the first basic pillar which 
forms the foundation of the speakerphone algorithm 41. The timers fall into three categories: a) a 

30 125us frame timer or variable, b) a 20ms state machine variable, and c) a 160ms hold variable and a 
80ms duplex variable. The 125us frame variable is the only variable which is implemented in 
hardware. The 125us frame variable generates a hardware interrupt to the uP 38 on every speech 
frame so that one of the hands-free registers 34, 36 can be read by a software peak detector. 

The 20ms state machine variable is a RAM Sample_Counter variable which is implemented 

35 in the interrupt service routine for the 125us frame variable. The RAM Sample_Counter variable is 
incremented by one each time the interrupt service routine is called (every 125us), and when it reaches 
160 (20ms), the value is cleared and the Do_HF boolean flag is set. This flag is polled by the main() 
wireless telephone control program, and when set, causes execution of the hands-free speakerphone 
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algorithm 41. Thus the speakerphone algorithm is executed once every 20ms, which means that the 
peak detection window is 20ms, and the speakerphone state machine 60 either remains in the same 
state or advances to a new state every 20ms. 

The 160ms hold variable is a software timer which is implemented inside the speakerphone 
5 state machine 60 by the use of a RAM Hold_Time variable. The RAM Hold_Time variable holds the 
state machine 60 in the current state before it enters the idle state, i.e.: it adds a delay between the 
active RX, TX, or full-duplex states and the idle state. This empirically-derived delay prevents the 
state machine 60 from jumping between the active states and the idle state during the short quiet gaps 
and pauses in normal speech. The Hold__Time variable is initialized to 160ms upon entry into the RX, 

10 TX, and full-duplex states, and is used in conjunction with the duplex variable. 

The 80ms software duplex timer is implemented inside the speakerphone state machine 60 by 
the use of a RAM Duplex JTime variable. The primary purpose of this variable is to prevent an abrupt 
change in duplex when both people are talking at (nearly) the same time. For example, it would be 
very undesirable for the speakerphone to oscillate between the RX and TX states when both people are 

15 talking. Instead the duplex variable holds the state machine 60 in the full-duplex state during the 
short quiet gaps and pauses in both people's normal speech. The Duplex_Time variable is initialized 
to 80ms upon entry into the RX, TX and full-duplex states, and is used in conjunction with the hold 
variable as follows. 

Referring now to Fig. 2, the state machine 60 is a software routine which has memory of its 

20 past state, the current state, and its future states by means of state variables stored in RAM. The 
state machine 60 makes its decisions by executing. software instructions. Electronic state machines 
are either typically implemented in hardware (by flip-flops or latches) or in software (by a routine 
which operates on state variables stored in RAM). In the preferred embodiment, the state machine 
60 is implemented in software. The inputs to the speakerphone state machine 60 Eire comprised of the 

25 peak volume levels of both speech paths together with the current microphone and speaker gain 
settings. The speakerphone state machine 60 compares the peak volume levels of both speech paths 
to pre-defined threshold levels, monitors the current microphone and speaker gain settings, and finally 
determines the optimum gain settings for the present volume levels. The speakerphone state machine 
60 consists of 4 operating states: a RX 62, a TX 64, a full-duplex 66, and idle state 68. 

30 Fig. 2 shows how the Hold_Time variable and the Duplex_Time variable are used in 

conjunction to provide the state transition delays. Specifically, the hold and duplex variables are 
initialized to 160ms and 80ms respectively by loading the Hold_Time variable with a value of 8, and 
by loading the Duplex_Time variable with a value of 4. On each execution pass of the speakerphone 
state machine 60 (every 20ms), both of these variables are either reloaded with their initial values, or 

35 one of them decremented by 1, depending on the decision made by the state machine 60. If the state 
machine 60 detects sufficient volume in the RX and/or TX speech paths to enter or to remain in one 
of the active states (RX 62, TX 64, or full-duplex 66), both variables are reloaded with their initial 
values. If insufficient volume is present in the RX speech path, the duplex variable is decremented 
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by 1 in preparation for the pending state transition to the TX state 64. When the duplex variable is 
decremented to 0 (after 4 passes), the state machine 60 enters the TX state 64 and both variables are 
initialed. If insufficient volume is present in the TX speech path, the duplex variable is decremented 
by 1 in preparation for the pending state transition to the RX state 62. When the duplex variable is 
5 decremented to 0 (after 4 passes), the state machine 60 enters the RX state 62 and both variables are 
initialed. If insufficient volume is present in both speech paths, the state machine 60 will enter an 
idle state 68 after both variables are decremented to 0: the duplex variable is decremented to 0 first 
(after 4 passes), then the hold variable is decremented by 1. When the hold variable is also 
decremented to 0, twelve total execution passes of the state machine 60 have elapsed, and the state 

10 machine 60 enters the idle state 68. 

The peak detector algorithm 70 is the second basic pillar which forms the foundation of the 
speakerphone algorithm 41. The most basic piece of information needed by the speakerphone 
algorithm 41 is the relative volume of the two speech paths. The transmit and receive speech paths 
of the controller 14 are conveyed by the codec 32, whose sample rate is the standard frame rate of 

15 8kHz, so the hands-free registers 34, 36 are updated with fresh values every 125ps. Because these 
registers return the current digital magnitude of the two speech paths at the instant they are read, 
a software peak detector algorithm 70 is necessary to determine the maximum signal level during a 
given time. 

Referring now to Fig. 3, as previously described, in step 72 the 125us frame variable generates 

20 the interrupts which cause the on-chip pP 38 to read one of the hands-free registers 34, 36 during the 
frame variable interrupt service routine. In step 74 a boolean flag HF_Toggle is used to keep track 
of which hands-free register to read during alternating passes. The hands-free register RX 34 is read 
during one pass, and the hands-free register TX 36 is read during the next, so the sample rate for both 
registers is 250us. If HF_Toggle=l during an execution of the interrupt service routine, then in step 

25 76 HF_Toggle is set equal to zero. In step 78 the pP 38 reads the hands-free register RX 34, and in 
step 80 compares the value to the saved (peak) value stored in the RAM variable RX_Peak. If the 
fresh value is greater than the saved value, then in step 82 the fresh value is stored in RX_Peak for 
future use. In step 84, the fresh value is discarded if it is less than or equal to the saved peak value. 
Likewise, if HF_Toggle=0 during an execution of the interrupt service routine, then in step 86 

30 HF_Toggle is set equal to one. In step 88 the pP 38 reads the hands-free register TX 36, and in step 
90 compares the value to the saved (peak) value stored in the RAM variable TX_Peak. If the fresh 
value is greater than the saved value, then in step 92 the fresh value is stored in TX_Peak for future 
use. In step 94 the fresh value is discarded if it is less than or equal to the saved peak value. The RX 
and TX peak values are accumulated in this way over the whole Peak Detection Window, which is 80 

35 samples at 250us each, or 20ms. 

When the state machine 60 variable reaches 160 (20ms), the Do_HF Boolean flag is set, and 
the pP 38 soon executes the hands-free state machine 60 which first copies the current values of 
RX_Peak and TX_Peak to separate RAM locations called RX_Max and TX_Max respectively (because 
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RX_Peak and TX_Peak are continually updated by the frame variable interrupt service routine). These 
separate RAM locations are used for all subsequent volume level comparisons within the speakerphone 
algorithm 41. 

Referring now to Fig. 4, after the telephone call is made and the connection is established, the 
5 speakerphone in step 100 begins in the idle state 68 because both speech paths are quiet, because 
neither person has started talking yet. This "quiet level" is defined in software as a background noise 
level threshold, and an independent threshold is assigned for the speakerphone-side and the far-end 
environments. As shown in Fig. 2, and in the idle state 68 flow chart in Fig. 4, the state machine 60 
decides during the current cycle whether to remain in the idle state 68, to enter the RX state 62, or 

10 the TX state 64, when the current cycle completes. 

In step 102 the uP 38 checks to see if the volume level received from the far-end (RX_Max) 
rises above the pre-defined background noise level (RX_Noise_Thresh). If so, the speakerphone state 
machine 60 assumes the far-end person has just begun to speak. If this volume level is greater than 
the near-end volume level (TX_Max), then the speakerphone state machine 60 in step 104 will enter 

15 the RX state 62 when the current cycle completes. If both of these conditions are not met, then in step 
106 the state machine 60 focuses on the TX speech path. If the near-end volume level (TX_Max) rises 
above the pre-defined background noise level (TX_Noise_Thresh), the speakerphone state machine 60 
assumes the near-end person has just begun to speak. If this volume level is greater than the far-end 
volume level (RX_Max), then the speakerphone state machine 60 in step 108 will enter the TX state 

20 64 when the current cycle completes. If both of these conditions are not met, then in step 110 the 
speakerphone state machine 60 remains in the idle state 68 until the next cycle, when the volume 
levels will be analyzed again. 

Referring now to Fig. 5, the RX state 62 is defined from the speakerphone's point of view as 
the state when the far-end person is talking but the person near the speakerphone is not talking. As 

25 shown in Fig. 2 and in Fig. 5, the state machine 60 decides during the current cycle whether to remain 
in the RX state 62, or to enter the TX state 64, the full-duplex state 66, or the idle state 68 when the 
current cycle completes. 

Step 112 is the entry point for the RX state 62 algorithm. In step 114, if the volume level 
received from the far-end (RX_Max) is still greater than the pre-defined background noise level 

30 (RX_Noise_Thresh), the speakerphone state machine 60 assumes the far-end person is still speaking. 

Then in step 116, if the near-end volume level (TX_Max) exceeds the volume level received from the 
far-end (RX_Max) by a dynamic "both" level (Both_Thresh), the speakerphone state machine 60 
assumes the near-end person has just begun to speak in addition to the far-end person, and in step 
118 it will enter the full-duplex state when the current cycle completes. (Both_Thresh is detailed in 

35 a later section.) If the RX volume level exceeds the noise threshold, but insufficient TX volume is 
detected, then in step 120 the speakerphone state machine 60 remains in the RX state 62 until the 
next cycle, when the volume levels will be analyzed again. 
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If the RX volume level does not exceed the noise threshold, then in step 122 the state machine 
60 focuses on the TX Speech Path. In step 122, if the TX volume level rises above the pre-defined 
background noise level (TX_Noise_Thresh), the speakerphone state machine 60 assumes the near-end 
person has just begun to speak and the far-end person has stopped. If the Duplex Variable has been 
5 decremented to 0, then in step 124 the speakerphone state machine 60 will enter the TX state 64 when 
the current cycle completes. If not, then in steps 126, 128 and 120 the duplex variable is decremented 
by 1 and the speakerphone state machine 60 remains in the RX state 62 until the next cycle. Thus 
the duplex variable imposes an 80ms transition delay from the RX state 62 to the TX state 64. The 
state delays are implemented by executing multiple passes through the state machine. 
10 The 80ms delay from RX to TX state goes through the following steps: 

step 112 
step 114: no 

step 122: no, TX_Max is greater, but Duplex_Time = 4 (initial value) 
step 126: no 
15 step 128: Duplex_Time = 3 
step 120 

(wait 20ms, re-execute state machine) 
step 112 
step 114: no 

20 step 122: no, TX_Max is greater, but Duplex_Time = 3 
step 126: no 

step 128: Duplex_Time = 2 
step 120 

(wait 20ms, re-execute state machine) 
25 step 112 

step 114: no 

step 122: no, TX_Max is greater, but Duplex_Time = 2 
step 126: no 

step 128: DuplexJTime = 1 
30 step 120 

(wait 20ms, re-execute state machine) 
step 112 
step 114: no 

step 122: no, TX_Max is greater, but Duplex_Time = 1 
35 step 126: no 

step 128: Duplex_Time = 0 
step 120 

(wait 20ms, re-execute state machine) 
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step 112 
step 114: no 

step 122: yes, TX_Max is greater, Duplex_Time = 0 
step 124: exit to TX state 

5 If neither the RX volume nor the TX volume level exceeds their respective noise thresholds, 

the speakerphone state machine 60 delays a total of 240ms (12 passes) before entering the idle state 
68. This is accomplished in steps 126 to 134 by decrementing the duplex and hold variables during 
successive cycles of the speakerphone state machine 60. 

The 240ms delay from RX to Idle state goes through the following steps: 
10 step 112 

step 114: no 

step 122: no, TX_Max not greater, and Duplex_Time = 4 (initial value) 
step 126: no 

step 128: Duplex_Time = 3 
15 step 120 

(wait 20ms, re-execute state machine) 
step 112 
step 114: no 

step 122: no, TX_Max not greater, Duplex_Time = 3 
20 step 126: no 

step 128: Duplex_Time = 2 
step 120 

(wait 20ms, re-execute state machine) 
step 112 
25 step 114: no 

step 122: no, TX_Max not greater, Duplex_Time = 2 
step 126: no 

step 128: Duplex_Time = 1 
step 120 

30 (wait 20ms, re-execute state machine) 
step 112 
step 114: no 

step 122: no, TX_Max not greater, Duplex_Time = 1 
step 126: no 
35 step 128: Duplex JTime = 0 
step 120 

(wait 20ms, re-execute state machine) 
step 112 
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step 114: no 

step 122: no, TX_Max not greater, Duplex_Time = 0 
step 126: yes 

step 130: no, Hold_Time still = 8 (initial value) 
5 step 132: Hold_Time = 7 
step 120 

(wait 20ms, re-execute state machine) 
step 112 
step 114: no 

10 step 122: no, TX_Max not greater, Duplex JTime = 0 
step 126: yes 

step 130: no, HoldJTime = 7 
step 132: Hold_Time = 6 
step 120 

15 (wait 20ms, re-execute state machine) 
step 112 
step 114: no 

step 122: no, TX_Max not greater, DuplexJTime = 0 
step 126: yes 
20 step 130: no, HoldJTime = 6 
step 132: Hold_Time = 5 
step 120 

(wait 20ms, re-execute state machine) 
step 112 
25 step 114: no 

step 122: no, TX_Max not greater, Duplex_Time = 0 
step 126: yes 

step 130: no, Hold_Time = 5 
step 132: Hold_Time = 4 
30 step 120 

(wait 20ms, re-execute state machine) 
step 112 
step 114: no 

step 122: no, TXJMax not greater, DuplexJTime = 0 
35 step 126: yes 

step 130: no, HoldJTime = 4 
step 132: HoldJTime = 3 
step 120 



- 10 - 




TT1220 

(wait 20ms, re-execute state machine) 
step 112 
step 114: no 

step 122: no, TX_Max not greater, Duplex_Time = 0 
5 step 126: yes 

step 130: no, Hold_Time = 3 
step 132: Hold.Time = 2 
step 120 

(wait 20ms, re-execute state machine) 
10 step 112 

step 114: no 

step 122: no, TX_Max not greater, Duplex_Time = 0 
step 126: yes 

step 130: no, Hold_Time = 2 
15 step 132: Hold.Time = 1 
step 120 

(wait 20ms, re-execute state machine) 
step 112 
step 114: no 

20 step 122: no, TX_Max not greater, Duplex.Time = 0 - 
step 126: yes 

step 130: no, Hold_Time = 1 
step 132: Hold_Time = 0 
step 120 

25 (wait 20ms, re-execute state machine) 
step 112 
step 114: no 

step 122: no, TX_Max not greater, Duplex_Time = 0 

step 126: yes 
30 step 130: yes 

step 134: exit to Idle State 

Referring now to Fig. 6, the TX state 64 is defined from the speakerphone's point of view as 

the state when the person near the speakerphone is talking but the far-end person is not talking. As 

shown in Fig. 2 and in Fig. 6, the state machine 60 decides during the current cycle whether to remain 
35 in the TX state 64, or to enter the RX state 62, the full-duplex state 66, or the idle state 68 when the 

current cycle completes. 

Step 140 is the entry point for the TX state 64 algorithm. In step 142, if the near-end volume 

level (TX_Max) is still greater than the pre-defined background noise level (TX_Noise_Thresh), the 
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speakerphone state machine 60 assumes the near-end person is still speaking. Then in step 144 if the 
volume level received from the far-end (RX_Max) exceeds the near-end volume level, the speakerphone 
state machine 60 assumes the far-end person has just begun to speak in addition to the near-end 
person, and in step 146 it will enter the Near Full-Duplex State when the current cycle completes. 
5 If the TX volume level exceeds the noise threshold but insufficient RX volume is detected, then in step 
148 the speakerphone state machine 60 remains in the TX state 64 until the next cycle, when the 
volume levels will be analyzed again. 

In step 142, if the TX volume level does not exceed the noise threshold, the state machine 60 
focuses on the RX Speech Path. In step 150, If the RX volume level rises above the pre-defined 

10 background noise level (RX_Noise__Thresh), the speakerphone state machine 60 assumes the far-end 
person has just begun to speak and the near-end person has stopped. If the duplex variable has been 
decremented to 0, then in step 152 the speakerphone state machine 60 will enter the RX state 62 when 
the current cycle completes. If not, then in steps 154, 156, and 148 the duplex variable is decremented 
by 1 and the speakerphone state machine 60 remains in the TX state 64 until the next cycle. Thus 

15 the duplex variable imposes an 80ms transition delay from the TX state 64 to the RX state 62. 

The state delays are implemented by executing multiple passes through the state machine. The 
80ms delay from TX to RX state goes through the following steps: 
step 140 
step 142: no 

20 step 150: no, RX_Max is greater, but DuplexJTime = 4 (initial value) 
step 154: no 

step 156: DuplexJTime = 3 
step 148 

(wait 20ms, re-execute state machine) 
25 step 140 

step 142: no 

step 150: no, RX„Max is greater, but DuplexJTime - 3 
step 154: no 

step 156: DuplexJTime = 2 
30 step 148 

(wait 20ms, re-execute state machine) 
step 140 
step 142: no 

step 150: no, RX_Max is greater, but DuplexJTime = 2 
35 step 154: no 

step 156: DuplexJTime = 1 
step 148 

(wait 20ms, re-execute state machine) 
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step 140 
step 142: no 

step 150: no, RX_Max is greater, but Duplex_Time = 1 
step 154: no 

step 156: Duplex_Time = 0 
step 148 

(wait 20ms, re-execute state machine) 
step 140 
step 142: no 

step 150: yes, RX_Max is greater, Duplex_Time = 0 
step 152: exit to RX state 

If neither the TX volume nor the RX volume level exceeds their respective noise thresholds, 
the speakerphone state machine 60 delays a total of 240ms (12 passes) before entering the Idle state 
68. This is accomplished in steps 154 to 162 by decrementing the duplex and hold variables during 
successive cycles of the speakerphone state machine 60. 

Referring now to Fig. 7, the speakerphone state machine 60 enters the full-duplex state 
whenever both people are simultaneously talking. As shown in Fig. 2 and Fig. 7, the state machine 
60 decides during the current cycle whether to remain in the Full-Duplex State, or to enter the TX or 
Idle state 68 when the current cycle completes. Once the speakerphone enters this state, it remains 
here until the far-end person stops talking, in which case the speakerphone returns to the Idle state 
68. The reason for this is because it is much simpler in practice to detect that the far-end person has 
stopped talking than to detect that the near-end person has stopped talking, due to the acoustic 
coupling of the speakerphone's speaker to its microphone. 

Step 170 is the entry point for the full-duplex state 66 algorithm. In step 172, if the volume 
level received from the far-end (RX_Max) is still greater than the pre-defined background noise level 
(RX_Noise_Thresh), the speakerphone state machine 60 assumes the far-end person is still speaking, 
and therefore remains in the full-duplex state 66 until the next cycle, when the volume levels will be 
analyzed again. 

In step 172, if the RX volume level does not exceed the noise threshold, the state machine 60 
focuses on the TX Speech Path. In step 174, if the TX volume level is still greater than the pre-defined 
background noise level (TX_Noise_Thresh), the speakerphone state machine 60 assumes the near-end 
person is still speaking but the far-end person has stopped. If the duplex variable has been 
decremented to 0, then in step 176 the speakerphone state machine 60 will enter the TX state 64 when 
the current cycle completes. If not, in step 180 the duplex variable is decremented by 1, and the 
speakerphone state machine 60 remains in the full-duplex state 66 until the next cycle. Thus the 
duplex variable imposes an 80ms transition delay from the full-duplex state to the TX state 64. 
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The state delays are implemented by executing multiple passes through the state machine. 
The 80ms delay from Full-Duplex state to TX state goes through the following steps: 
step 170 
step 172: no 

5 step 174: no, TX_Max is greater, but Duplex_Time = 4 (initial value) 
step 178: no 

step 180: Duplex_Time = 3 
step 188 

(wait 20ms, re-execute state machine) 
10 step 170 

step 172: no 

step 174: no, TX_Max is greater, but Duplex_Time = 3 
step 178: no 

step 180: DuplexJTime = 2 
15 step 188 

(wait 20ms, re-execute state machine) 
step 170 
step 172: no 

step 174: no, TX_Max is greater, but Duplex_Time = 2 
20 step 178: no 

step 180: Duplex_Time = 1 
step 188 

(wait 20ms, re-execute state machine) 
step 170 
25 step 172: no 

step 174: no, TX_Max is greater, but Duplex JTime - 1 
step 178: no 

step 180: Duplex_Time = 0 
step 188 

30 (wait 20ms, re-execute state machine) 
step 170 
step 172: no 

step 174: yes, TX_Max is greater, Duplex JTime = 0 
step 176: exit to TX state 

35 If neither the RX volume nor the TX volume level exceeds their respective noise thresholds, 

the speakerphone state machine 60 delays a total of 240ms (12 passes) before entering the idle state 
68. This is accomplished in steps 178 to 184 by decrementing the duplex and hold variables during 
successive cycles of the speakerphone state machine 60. 
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The final task of the speakerphone algorithm 41 is the digital adjustment of the microphone 
and speaker gains as directed by the speakerphone state machine 60. The inputs to a gain adjustment 
routine 230 are the volume levels in the RX and TX speech paths (RX_Max and TX_Max) and the 
current state. The portable handset 16 implements fixed gain settings in the idle state 68 and in the 
5 TX state 64, but Automatic Gain Control (AGC) is implemented in software in the RX state 62 and in 
the full-duplex state 66. In all cases, the gain adjustment routine 230 selects the optimum gain 
settings for the present volume levels, based on a pre-defined correspondence of volume levels and gain 
settings, i.e.: for any given combination of RX and TX volume levels, the optimum gain setting was 
experimentally determined during the development of the wireless speakerphone system 10. Thus 

10 there is no guesswork or adapting process in the gain adjustment routine 230; it simply outputs one 
gain setting for the current volume inputs it receives during each time it is executed. 

The same inputs (RX_Max and TX_Max) are given to the speakerphone state machine 60 as 
to the gain adjustment routine 230. These inputs are copied from RX_Peak and TX_Peak prior to the 
execution of the speakerphone state machine 60, and thus remain constant long after the end of the 

15 gain adjustment routine 230. In addition, a routine RX AGC 231 requires the RX volume level to be 
quantified into several volume ranges, so this quantification is done by the RX quantification routine 
190, just after RX_Max and TX_Max are updated, and just before the speakerphone state machine 60 
begins. 

All gain adjustments are accomplished simply by writing a gain coefficient value to the 
20 appropriate gain control register inside the controller chip 14. There are three such registers used by 
the speakerphone algorithm 41: the RX attenuation register 52; the TX attenuation register 54; and 
the microphone pre-amplification register 46. 

Referring now to Fig. 9a, the gain adjustment routine 230, entered at step 223, does only one 
of four possible things, depending on the next speakerphone state: 

25 

a) If the speakerphone state machine 60 just decided in step 224 that the next state to be 
entered is the idle state 68, then in step 225 the gain adjustment routine 230 sets 
pre-determined speaker and microphone gains by simply writing the appropriate gain 
coefficient value to the gain control registers. 

30 

b) If the speakerphone state machine 60 just decided in step 226 that the next state to be 
entered is the TX state 64, then in step 227 the gain adjustment routine 230 sets 
pre-determined speaker and microphone gains by simply writing the appropriate gain 
coefficient value to the gain control registers. 

35 

c) If the speakerphone state machine 60 decides in step 228 that the next state to be entered 
is the RX state 62, then in step 232 the gain adjustment routine 230 executes the RX AGC 231 
(shown in Fig. 9b). (Note that the RX quantization routine 190 has already been executed by 
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this time). The RX quantification routine 190 stored the result of its RX volume comparisons 
in the RAM variable RX_Level, for later use by the RX AGC 231. The RX AGC 231 uses the 
output of the RX quantification routine 190 (RX_Level) in order to decide the optimal gain 
coefficients to be written to the gain control registers. 

5 

d) If in step 228 the speakerphone state machine 60 just decided that the next state to be 
entered is the full-duplex state 66, the gain adjustment routine 230 executes the full-duplex 
AGC routine, entered at step 260. The full-duplex AGC routine is shown in Fig. 11. Step 274 
in Fig. 11 causes immediate execution of the three AGC substates shown in Figures 13a and 
10 13b: 1) if the substate = 40/60, execution begins at step 280 in Fig. 13a; or 

2) if the substate = 60/40, execution begins at step 282 in Fig. 13a; or 

3) if the substate = 50/50, execution begins at step 302 in Fig. 13b. 

When the idle state 68 is entered, the gain adjustment routine 230 sets the RX gain to -12dB, 
sets the TX gain to -9dB, and sets the Pre-Amp gain to +18dB. These gain settings are accomplished 
15 when the uP 38 writes the following coefficients to the gain control registers: 

Gain Control Register Value Written Gain Setting 

RX Attenuation 52 20h -12dB 

TX Attenuation 54 2D H -9dB 

20 Mic. Pre-Amp. 46 90h +18dB 

In the RX state 62, which is a half-duplex state, the speaker gain is controlled in software by 
Automatic Gain Control (AGC), and the microphone gain is significantly reduced. Here, the AGC 
keeps the RX volume level as loud as possible within the practical constraints imposed by the portable 
25 handset 16. 

Referring now to Fig. 8, the purpose of an RX quantification routine 190 is to quantify the RX 
volume level into one of six volume ranges, and to pass this information to the gain adjustment routine 
230 for use by the RX AGC 231. This quantification is accomplished by comparing the RX volume level 
(RX_Max) to pre-defined volume levels (steps 194 to 202), and storing the result in the RAM variable 
30 RX_Level, as shown below. RX_Max has a maximum range of 00h to 7F H . 



RXJMax Volume Range Value stored in RX_Level Step 

73 H to 7F H 5 204 

63 H to 7^ 4 206 

35 53 H to 6^ 3 208 

43 H to 52a 2 210 

33 H to42a 1 212 

00 H to 32h 0 214 
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The RX quantification routine 190 also adjusts the value of BothJThresh according to the RX 
volume level, in steps 216, 218, and 220. Both_Thresh represents the amount of energy that the TX 
volume must exceed the RX volume in order to exit the RX state 62 and enter the full-duplex state 66, 
i.e., how loud the near-end person must speak in order for the speakerphone algorithm 41 to recognize 
5 that both people are simultaneously speaking. The reason why BothJThresh is dynamic rather than 
a fixed value is directly related to the RX AGC 231. At low RX volume levels, the RX AGC 231 will 
boost the RX signal as much as possible to allow the near-end person to hear better. Due to acoustic 
coupling, this amplification results in a strong signal at the microphone, so some means is needed to 
prevent the speakerphone algorithm 41 from misinterpreting this strong microphone signal as near- 

10 end speech. Conversely, at high RX volume levels, the RX AGC 231 does not need to add much 
amplification in order for the signal to be heard well. In this case it would be undesirable to require 
the near-end person to shout at the speakerphone in order to enter the full-duplex state 66, so some 
reasonable means is need to allow this state transition to occur. Therefore Both_Thresh is 
implemented to create a "cushion" of volume to determine when the near-end person is speaking 

15 simultaneously with the far-end person. At low RX volume levels, Both_Thresh is large enough to 
mask acoustic coupling, and at high RX volume levels, Both_Thresh is small enough to allow a 
reasonable TX volume to cause the transition to the full-duplex state 66. 

Referring now to Fig. 9b, when the gain adjustment routine 230 is executed, the RX AGC 231 
dramatically reduces the TX Speech Path gain by setting the Pre- Amp gain to +3dB, sets the TX gain 

20 to -9dB, and controls the RX gain in software by the. AGC. The RX quantification routine 190 has 
already updated RX_Level with the volume range of RX_Max, so the AGC is simply a matter of 
selecting higher gain for weak RX signals and lower gain for strong RX signals, as shown below. 



25 



Gain Control Register 
Mic. Pre -Amp. 46 
TX Attenuation 54 



Value Written . 

80h 
2D H 



Gain Setting 
+3dB 
-9dB 



Referring again to Fig. 9b, the steps of the RX AGC 231 are shown below in the far right column. 



30 



35 



RX_Level 

0,1,2,3 
4 
5 



Gain Control 
Register 

RX Attenuation 52 
RX Attenuation 52 
RX Attenuation 52 



Value 
Written 

7F H 
5B H 
2D H 



Gain 
Setting 

OdB 

-3dB 

-9dB 



Steps 

234, 238, 242 
234, 238, 240 
234, 236 



Due to the particular microphone 42 (Tram Model No. TR-50, manufactured by Tram 
Electronics, Inc., Cookstown, NJ 08511) used in the portable handset 16, the external amplifier 48, and 
the physical location in the portable handset 16, it is not necessary to graduate the TX volume level 
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into ranges by a TX quantification routine. The center of the microphone (3/8" diameter) is 6 1/8" 
inches from the center of the speaker (2" diameter). Both components are oriented facing the same 
direction (up, when the speakerphone is placed flat on a desktop). Here, the "acceptable" background 
noise level limits the maximum microphone gain applied, with the result that a single digital gain 
5 setting is sufficient for the entire range of TX volume levels. 

When the TX state 64 is entered, the gain adjustment routine 230 reduces the RX gain to - 
24dB, boosts the TX gain to OdB, and sets the Pre-Amp gain to +18dB. These gain settings are 
accomplished when the pP 38 writes the following coefficients to the gain control registers. 

10 Gain Control Register Value Written Gain Setting 

RX Attenuation 52 08 H -24dB 

TX Attenuation 54 7F H OdB 

Mic. Pre-Amp. 46 90 H +18dB 

15 In the Full-Duplex State, software Automatic Gain Control (AGO regulates the gain 

proportions of the microphone and speaker amplifiers 54, 52 to keep the volume high in both speech 
paths without producing unstable audio feedback. By constantly monitoring the volume levels in the 
speech paths (as reflected in RX_Max and TX_Max), and by constantly adjusting the amplifier gains 
accordingly, the AGC dynamically regulates the: balance of the two speech paths to allow both people 

20 to speak and hear simultaneously. 

Referring now to Fig. 10, the full-duplex AGC is implemented as three substates within the 
full-duplex state 66, where each substate corresponds to a different gain combination. These substates 
are named "40/60", "50/50" and "60/40" to reflect the percentage ratio of RX-to-TX gain. The 40/60 
substate's gain setting emphasizes the microphone (for weak near-end speech), the 50/50 Substate's 

25 gain setting handles the case when strong volumes are present in both speech paths, and the 60/40 
Substate's gain setting emphasizes the speaker (for weak far-end speech). The same pre-amp gain is 
programmed for each of the substates within the full-duplex state 66. 

Referring now to Fig. 11, during each execution pass of the speakerphone state machine 60, 
if the full-duplex state 66 is entered, in step 260, only one of the three AGC substates is entered. In 

30 step 262, if the previous state was the RX state 62, then in step 264 the pre-amp gain is boosted to 
+18dB, and then in step 266 the substate is initialized to the 60/40 substate. In step 268, if the 
previous state was the idle state 68, or in step 270, if the previous state was the TX state 64, then in 
step 272 the substate is initialized to the 40/60 substate. If the previous state was the full-duplex 
state 66, then in step 274 the substate is unchanged to retain the previous AGC substate. 

35 During the current cycle, a substate machine decides which one of the three substates will be 

entered when the current cycle completes. This decision is based on the RX and TX volume levels 
(RX_Max and TX__Max), and on the current substate. The substate machine compares RX_Max and 
TX_Max to pre-defined value ranges to determine the optimal gains settings for the present volume 
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levels. These value ranges were experimentally determined during the development of the wireless 
speakerphone system 10, and resulted in the creation of a mapping of RX and TX volume levels to 
optimal gain settings. The mapping for the wireless speakerphone system 10 is shown in Fig. 12. The 
key to the stability of the AGC is to include hysteresis in the range boundaries, to avoid the metastable 
condition of the AGC oscillating between two substates in successive execution passes of the 
speakerphone state machine 60. The mapping shown in Fig. 12 for the wireless speakerphone system 
10 includes this type of hysteresis. Two examples are given next to explain how hysteresis is 
implemented. 

Example 1: 

In the 40/60 and 60/40 substates, TX_Max must: rise above 7D H and RX_Max must rise above 
6C H in order to cause a substate transition to the 50/50 substate. The AGC remains in the 
50/50 substate until TX_Max falls below 71 H and until RX_Max falls below 69h, thus allowing 
OCh of TX hysteresis and 03« of RX hysteresis between the 50/50 substate and the 40/60, 60/40 
substates. 

Example 2: 

Because the 40/60 and 60/40 substates share the same mapping, it may appear at first that 
there is no hysteresis between these two substates. However, an understanding of the 
speakerphone system explains the hysteresis.- Increasing the RX gain increases the value of 
TX_Max due to acoustic coupling of the speaker to the microphone, which results in an 
apparent increase in TX speech volume from the point of view of the 38. In the 40/60 
substate the RX gain is 40% of the combined RX and TX gains. If the transmit speech gets 
loud enough to cause TX_Max to rise above 6E H , the AGC initiates a substate transition to the 
60/40 substate, and thus decreases the TX gain in order to compensate for the apparent 
increased TX speech volume. The transition also increases the RX gain from 40% to 60%, 
which from the point of view of the yiP 38, increases the apparent TX volume as indicated by 
TX_Max. In the portable handset 16, this increase is generally 03 H to 04 , which means 
TX_Max is now 71 H to 72h. The AGC remains in this 60/40 substate until TX_Max falls back 
below 6E H , thus allowing more than 03a of hysteresis between the 40/60 and 60/40 substates. 

Thus by determining which range the RX and TX volume levels fall into, the AGC decides 
which next substate (and consequently which pre-defined gain setting) is best for the associated input 
volume levels. 

Figs. 13 and 14 show the decision process which implements the mapping of RX and TX volume 
levels to optimal gain settings, and defines the next substate. Referring now to Fig. 13a, the pP 38 
begins at either step 280 or step 282, depending on the substate decided by the full-duplex AGC in Fig. 
11. In step 284, the AGC determines if TX_Max is greater than 7D H . If it is not, then in step 286 the 
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AGC determines if TX_Max is less than 6EH. If it is, then in step 288 the RX gain is set equal to - 
12dB, and the TX gain is set equal to -9dB. In step 290, the substate machine will enter the 40/60 
substate when the current cycle completes. In step 286, if TX_Max is not less than 6EH, then in step 
292 the RX gain is set equal to -9dB, and the TX gain is set equal to -12dB. In step 294, the substate 
5 machine will enter the 60/40 substate when the current cycle completes. Referring back to step 284, 
if TX_Max is greater than 7D H , then in step 296 the AGC determines if RX_Max is greater than 6CH. 
If it is not, then in step 292 the RX gain is set equal to -9dB, and the TX gain is set equal to -12dB. 
In step 294, the substate machine will enter the 60/40 substate when the current cycle completes. 
Referring back to step 296, if RX_Max is greater than than 6CH, then in step 298 the RX gain is set 

10 equal to -12dB, and the TX gain is set equal to -12dB. In step 300, the substate machine will enter 
the 50/50 substate when the current cycle completes. 

Referring now to Fig. 13b, in step 302 the pP 38 begins at step 302 if the substate decided by 
the full-duplex AGC in Fig. 11 is the 50/50 substate. In step 304, the AGC determines if TX_Max is 
less than 68H. If it is, then in step 306 the RX gain is set equal to -12dB, and the TX gain is set equal 

15 to -9dB. In step 308, the substate machine will enter the 40/60 substate when the current cycle 
completes. Referring back to step 304, if TX_Max is not less than 68H, then in step 310 the AGC 
determines if RX_Max is less than 5EH. If it is, then in step 312 the RX gain is set equal to -9dB, and 
the TX gain is set equal to -12dB. In step 314, the substate machine will enter the 60/40 substate 
when the current cycle completes.. Referring back to step 310, if RXJMax is not less than 5EH, then 

20 in step 316 the AGC determines if RX_Max is less, than 69H. In step 318, the substate machine 
remains in the 50/50 substate until the next cycle. If RX_Max is less than 69H, then in step 320 the 
AGC determines if TX_Max is greater than 71H. If it is, then in step 318, the substate machine 
remains in the 50/50 substate until the next cycle. If it is not, then in step 312, the RX gain is set 
equal to -9dB, and the TX gain is set equal to -12dB. In step 314, the substate machine will enter the 

25 60/40 substate when the current cycle completes. 

In addition to the preferred embodiment of a speakerphone in the portable handset 16, the 
same speakerphone algorithm 41 can be implemented in at least two other embodiments. In typical 
speakerphone configurations available on the market today, the half-duplex speakerphone function is 
implemented in the base station 18 rather than in the portable handset 16. Referring now to Fig. 14, 

30 in a first alternate embodiment of the invention, a wireless speakerphone system 400 has a second 
codec 33 in the base station 18. The same full-duplex speakerphone algorithm 41 can be executed by 
the base station's pP 39. In the alternate embodiment of Fig. 14, the two hardware requirements are 
maintained: 

the RX and TX volume levels are provided to the pP 39 in digital form; and 
35 the pP 39 can control the gain in the RX and TX speech paths. 

The portable handset 16 also includes a typical earphone/microphone 17. 

To further generalize, the same full-duplex speakerphone algorithm 41 can be executed by the 
pP 38 in any telephone system where the following two hardware requirements are maintained: 
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the RX and TX volume levels are provided to the uP 38 in digital form; and 
the uP 38 can control the gain in the RX and TX Speech paths. 

Fig. 15 shows such a second alternate embodiment of the invention. A wireless speakerphone 
system 500 includes analog- to-digital converters 502 and 504. The converter 502 sends a digital voice 
signal to a receive speech register 506, which has an 8kHz sample rate. The converter 504 sends a 
digital voice signal to a transmit speech register 508, which also has an 8kHz sample rate. A 
microprocessor pP 510 controls the functioning of the system 500, under the direction of the 
speakerphone algorithm 41 stored in a ROM 512. Amplifiers 514 and 516 amplify the voice signals 
from a microphone 518, and to a speaker 520. The amplifiers 514, 516 can be analog or digital as long 
as their gain is selectable by means of a uP data bus 522. 

The present invention has many advantages over the prior art. It provides better sound 
quality than the typical half-duplex speakerphone. It allows a full-duplex conversation, i.e.: 
simultaneous speaking and hearing. 

The present invention eliminates the need for an external analog speakerphone chip and a DSP 
engine, and instead performs the comparisons, decisions, and gain adjustments by a small on-chip uP 
38. The on-chip hands-free registers 34, 36 eliminate the need for a costly external speakerphone chip 
to implement the analog volume comparisons and make the speaker/microphone gain decisions. 

Software-programmable digital gains are provided on-chip, thus eliminating the need for 
expensive external analog decoders with resistor ladders. The user-controlled volume setting is 
. typically implemented in hardware, in an. analog speakerphone via a potentiometer, but suffers from 
degraded audio quality over time due to dust in the potentiometer mechanism and DC offset drift. The 
speakerphone handles volume control via software. 

The present invention eliminates the need for a second codec in the base station (one codec is 
needed for the analog interface to the telephone* line, and the second codec would be needed to handle 
the analog speakerphone interface). In the present invention, the second codec is provided in the 
portable handset 16 where it was already needed to complete the digital wireless voice connection. 

The user enjoys the obvious freedom of wires to the speakerphone enclosure, for example, when 
the wireless speakerphone is used in a large conference room, and when the same speakerphone is 
used in one of several different conference rooms. 

Multiple variations and modifications are possible in the embodiments of the invention 
described here. Although certain illustrative embodiments of the invention have been shown and 
described here, a wide range of modifications, changes, and substitutions is contemplated in the 
foregoing disclosure. In some instances, some features of the present invention may be employed 
without a corresponding use of the other features. For example, the algorithm 41 performs reasonably 
well if the following times are used: 

1. 125us Timer increased to 1ms or 2ms 

2. 20ms Timer increased to 80ms 

3. 160ms Hold Time + 80ms Duplex Time increased up to about 1 second 



- 21 - 




TT1220 

4. Peak Detection Window reduced from 80 samples to 40 samples 

It is possible to decrease rather than increase these times and achieve the same performance, 
but it would result in a higher power consumption in the portable handset 16, and therefore would 
reduce the user's maximum "talk time", because it would drain the battery faster. On the other hand, 
if one increases the timing, he would thus be increasing the "talk time" by reducing the portable 
handset's power consumption. 

A slight change in the speakerphone algorithm 41 will allow the user to adjust the "background 
noise level thresholds' 1 slightly, rather than having them fixed as they are in the preferred 
embodiment. For example, the background noise level is "pre-defined" for a typical quiet engineering 
office. On the other hand, for a typical noisy office at a stock brokerage, the user can press a near-end 
noise button on the handset 16 (not shown) a few times to adjust this background noise level (in small 
steps) for the cases when the noise level around him is higher. Likewise, if he hears that the noise 
level from the far-end is high, he can press a far-end noise button on the handset 16 (not shown) a few 
times to adjust the far-end background noise level threshold. The result is that the speakerphone 
would sound better. 

With the pre-defined noise thresholds currently in the algorithm 41, if someone calls the user 
from an excessively noisy place (maybe a gym), the high background noise will make the speakerphone 
algorithm 41 think (falsely) that the far-end person is constantly talking, and consequently it would 
remain in the RX state 62 when neither person is talking (instead of in the Idle state 68). By making 
a slight modification to the speakerphone algorithm 41, to .allow the noise thresholds to be variable 
instead of fixed, the speakerphone algorithm 41 would yield better performance in some cases. 
However, the preferred embodiment enjoys simplicity of design and operation, and thus does not 
implement the variable noise thresholds. The algorithm 41 simply compares the current volume levels 
to noise thresholds. The* algorithm 41 doesn't care if the threshold is fixed or variable, it just needs 
to know what is the threshold at the time it makes the comparison. 

Accordingly, it is appropriate that the foregoing description be construed broadly and 
understood as being given by way of illustration and example only, the spirit and scope of the invention 
being limited only by the appended claims. 
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